Sensor Synthesis for POMDPs with Reachability Objectives

نویسندگان

Krishnendu Chatterjee

Martin Chmelik

Ufuk Topcu

چکیده

Partially observable Markov decision processes (POMDPs) are widely used in probabilistic planning problems in which an agent interacts with an environment using noisy and imprecise sensors. We study a setting in which the sensors are only partially defined and the goal is to synthesize"weakest"additional sensors, such that in the resulting POMDP, there is a small-memory policy for the agent that almost-surely (with probability~1) satisfies a reachability objective. We show that the problem is NP-complete, and present a symbolic algorithm by encoding the problem into SAT instances. We illustrate trade-offs between the amount of memory of the policy and the number of additional sensors on a simple example. We have implemented our approach and consider three classical POMDP examples from the literature, and show that in all the examples the number of sensors can be significantly decreased (as compared to the existing solutions in the literature) without increasing the complexity of the policies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal Cost Almost-Sure Reachability in POMDPs

We consider partially observable Markov decision processes (POMDPs) with a set of target states and every transition is associated with an integer cost. The optimization objective we study asks to minimize the expected total cost till the target set is reached, while ensuring that the target set is reached almost-surely (with probability 1). We show that for integer costs approximating the opti...

متن کامل

Monte Carlo Sampling Methods for Approximating Interactive POMDPs

Partially observable Markov decision processes (POMDPs) provide a principled framework for sequential planning in uncertain single agent settings. An extension of POMDPs to multiagent settings, called interactive POMDPs (I-POMDPs), replaces POMDP belief spaces with interactive hierarchical belief systems which represent an agent’s belief about the physical world, about beliefs of other agents, ...

متن کامل

Compositional Synthesis of Reactive Controllers for Multi-agent Systems

In this paper we consider the controller synthesis problem for multi-agent systems that consist of a set of controlled and uncontrolled agents. Controlled agents may need to cooperate with each other and react to the actions of uncontrolled agents in order to fulfill their objectives. Besides, the controlled agents may be imperfect, i.e., only partially observe their environment, for example du...

متن کامل

Computational Techniques for Reachability Analysis of Partially Observable Discrete Time Stochastic Hybrid Systems

Reachability analysis of hybrid systems has been used as a safety verification tool to assess offline whether the state of a system is capable of remaining within a designated safe region for a given time horizon. Although it has been applied to stochastic hybrid systems, little work has been done on the equally important problem of reachability under incomplete or noisy measurements of the sta...

متن کامل

Networked Distributed POMDPs: A Synthesis of Distributed Constraint Optimization and POMDPs

In many real-world multiagent applications such as distributed sensor nets, a network of agents is formed based on each agent’s limited interactions with a small number of neighbors. While distributed POMDPs capture the real-world uncertainty in multiagent domains, they fail to exploit such locality of interaction. Distributed constraint optimization (DCOP) captures the locality of interaction ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1710.00675 شماره

صفحات -

تاریخ انتشار 2017

Sensor Synthesis for POMDPs with Reachability Objectives

نویسندگان

چکیده

منابع مشابه

Optimal Cost Almost-Sure Reachability in POMDPs

Monte Carlo Sampling Methods for Approximating Interactive POMDPs

Compositional Synthesis of Reactive Controllers for Multi-agent Systems

Computational Techniques for Reachability Analysis of Partially Observable Discrete Time Stochastic Hybrid Systems

Networked Distributed POMDPs: A Synthesis of Distributed Constraint Optimization and POMDPs

عنوان ژورنال:

اشتراک گذاری